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Amendments to the Claims: 

This listing of claims will replace all prior versions, and listings, of claims in the application: 
Listing of Claims: 

1 . (Currently Amended) A program parallelization device comprising: 

a control/data flow analysis unit which analyzes the control flow and the data flow of 
a sequential processing program; 

a fork point candidate determination unit which determines the fork point candidates 
of the sequential processing program by referring to the results of the analysis of the control 
flow and the data flow by said control/data flow analysis unit; 

a parallel execution performance evaluation unit which evaluates, with respect to an 
input data, a parallel execution performance when the sequential processing program has been 
parallelized by a test combination of fork point candidates that were given; 

a best fork point candidate combination determination unit which generates a test 
combination of the fork point candidates that were determined by said fork point candidate 
determination unit, provides the test combination to said parallel execution performance 
evaluation unit, and by taking the parallel execution performance of the test fork point 
candidate combination evaluated thereby as the reference, determines the best fork point 
candidate combination; and 

a parallelized program output unit which generates and outputs a parallelized program 
by inserting a fork command at each fork point candidate of the best combination determined 
by said best fork point candidate combination determination unit A 

wherein the program parallelization device transforms the sequential processing 
program into the parallelized program . 

2. (Currently Amended) A program parallelization device comprising: 

a control/data flow analysis unit which analyzes the control flow and the data flow of 
a sequential processing program; 
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a fork point candidate determination unit which determines the fork point candidates 
of the sequential processing program by referring to the results of the analysis of the control 
flow and the data flow by said control/data flow analysis unit; 

a parallel execution performance evaluation unit which evaluates, with respect to an 
input data, a parallel execution performance when the sequential processing program has been 
parallelized by a test combination of fork point candidates that were given; 

a best fork point candidate combination determination unit which generates a test 
combination only consisting of the combination of fork point candidates that can be 
simultaneously executed in the one-time fork model from the fork point candidates 
determined by said fork point candidate determination unit, provides the test combination to 
said parallel execution performance evaluation unit, and by taking the parallel execution 
performance of the test fork point candidate combination evaluated thereby as the reference, 
determines the best fork point candidate combination; 

a parallelized program output unit which generates and outputs a parallelized program 
by inserting a fork command at each fork point candidate of the best combination determined 
by said best fork point candidate combination determination unit, 

wherein the program parallelization device transforms the sequential processing 
program into the parallelized program . 

3. (Original) The program parallelization device as set forth in claim 1, 
wherein said parallel execution performance evaluation unit 

generates a sequential execution trace when the sequential processing program was 
sequentially executed with the input data, divides the sequential execution trace by taking all 
the terminal point candidates as division points, analyzes thread element information for each 
thread element, and simulates parallel execution by units of thread element with respect to the 
test combination of the fork point candidates that were given to calculate the parallel 
execution performance. 

4. (Original) The program parallelization device as set forth in claim 1, 
wherein said best fork point candidate combination determination unit 
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constructs a better combination by ranking the fork point candidates determined by 
said fork candidate determination unit in the order in which the fork point candidates are 
predicted to have an influence on parallel execution performance, and evaluating the parallel 
execution performance according to this order by taking the best fork point candidate 
combination at that time as the reference. 

5. (Original) The program parallelization device as set forth in claim 4, 

wherein said best fork point candidate combination determination unit 

assuming that the combination of the fork point candidates including the prescribed 
numbers from the top in the order of the fork point candidates determined is an initial 
combination, evaluates the parallel execution performance of the initial combination with said 
parallel execution performance evaluation unit, and sets the initial combination to the best 
fork point candidate combination at this time. 

6. (Original) The program parallelization device as set forth in claim 1, 

wherein said best fork point candidate combination determination unit 

divides said collection of all the fork point candidates that have been determined by 
said fork point candidate determination unit into fork point candidate groups in such a way 
that the fork point candidates have as little effects as possible on each other, generates a test 
fork point candidate combination for a group in the divided fork point candidate groups in 
which the best fork point candidate combination determination processing has not been 
performed, performs the best fork point candidate combination determination processing that 
determines the best fork point candidate combination by referring to the result of parallel 
execution performance of the test fork point candidate combination evaluated by said parallel 
execution performance evaluation unit, and determines the sum of the best fork point 
candidate combinations, which are the processing results for each group, as the overall 
processing result. 

7. (Original) The program parallelization device as set forth in claim 6, 
wherein said best fork point candidate combination determination unit 
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calls the fork point candidate group partition processing taking the collection of all the 
fork point candidates determined by said fork point candidate determination unit as a 
collection after the processing of said fork point candidate determination unit has been 
completed, when the fork point candidate group partition processing is called, starts the group 
partition processing of the collection if the number of fork point candidates belonging to the 
given fork point candidate collection is higher than the designated division number lower 
limit, returns to the origin from where the fork point candidate group partition processing was 
called without performing the group partition processing if the number of the fork point 
candidates is lower, divides from the collection the fork point candidate collections in which 
the number of fork point candidates that cancel themselves is higher than the designated 
number to generate a new group, further divides the collection into two groups, recursively 
calls the fork point candidate group partition processing taking one group as a collection and 
performs group partitioning of the group, recursively calls the fork point candidate group 
partitioning process taking the other group as a collection and performs group partitioning of 
the group, returns to the origin from where the fork point .candidate group partitioning process 
was called, performs the best fork point candidate combination determination processing for 
the groups in which the best fork point candidate combination determination processing has 
not been performed among the groups of fork point candidates that were divided, determines 
whether the processing of all the groups has been completed, if there is a group that has not 
been processed, reiterates the best fork point candidate combination determination processing 
for the groups that have not been processed, and, when the processing of all the groups has 
been completed, outputs the sum of the fork point candidate combination, which is the result 
of processing for each group, as the overall result. 

8. (Currently Amended) A program parallelization method for a multithreading 
method in which a sequential processing program is divided into a plurality of threads and a 
plurality of processors execute the threads in parallel, comprising the steps of: 

analyzing the control flow and the data flow of a sequential processing program; 

determining the fork point candidates of the sequential processing program by 
referring to the results of the analysis of the control flow and the data flow; 
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generating a test fork point candidate combination from the determined fork point 
candidates; 

evaluating, with respect to an input data, the parallel execution performance when the 
sequential processing program has been parallelized by the generated test fork point candidate 
combination; 

determining a best fork point candidate combination by taking the parallel execution 
performance of the evaluated test fork point candidate combination as the reference; inserting 
a fork command at each fork point candidate in the determined best combination to generate 
and output a parallelized program,, 

wherein the sequential processing program is transformed into the parallelized 
program . 

9. (Currently Amended) A program parallelization method for a multithreading 
method in which a sequential processing program is divided into a plurality of threads and a 
plurality of processors execute the threads in parallel, comprising the steps of: 

analyzing the control flow and the data flow of a sequential processing program; 

determining the fork point candidates of the sequential processing program by 
referring to the results of the analysis of the control flow and the data flow; 

generating a test combination only consisting of the fork point candidates that can be 
simultaneously executed in the one-time fork model from the determined fork point 
candidates; 

evaluating, with respect to an input data, the parallel execution performance when the 
sequential processing program has been parallelized by the generated test fork point candidate 
combination; 

determining a best fork point candidate combination by taking the parallel execution 
performance of the evaluated test fork point candidate combination as the reference; 

inserting a fork command at each fork point candidate in the determined best 
combination to generate and output a parallelized program,. 
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wherein the sequential processing program is transformed into the parallelized 
program . 

10. (Currently Amended) A program parallelization method for a multithreading 
method in which a sequential processing program is divided into a plurality of threads and a 
plurality of processors execute the threads in parallel, comprising: 

a step in which a control/data flow analysis unit analyzes the control flow and the data 
flow of a sequential processing program; 

a step in which a fork point candidate determination unit generates the fork point 
candidates by referring to the results of the analysis of the control flow and the data flow by 
the control/data flow analysis unit; 

a step in which a best fork point candidate combination determination unit predicts the 
effect of each of all the fork point candidates on the parallel execution performance and ranks 
the fork point candidates in the order of the effect; 

a step in which the best fork point candidate combination determination unit generates 
an initial fork point candidate combination whose parallel execution performance is evaluated 
first, and which is assumed to be the best fork point candidate combination; 

a step in which a parallel execution performance evaluation unit generates a sequential 
execution trace when the sequential processing program was sequentially executed with the 
input data; 

a step in which the parallel execution performance evaluation unit divides the 
sequential execution trace by taking all the terminal point candidates as division points 

a step in which the parallel execution performance evaluation unit analyzes thread 
element information for each thread element, and memorizes the thread element information 
for each thread element. 

a step in which a best fork point candidate combination determination unit selects one 
fork point candidate that is ranked highest order among the non-selected fork point 
candidates; 
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a step in which the best fork point candidate combination determination unit assesses 
whether the fork point candidate selected is contained in the best fork point candidate 
combination; 

a step in which, if the selected fork point candidate is not contained in the best fork 
point candidate combination, the best fork point candidate combination determination unit 
adds the selected fork point candidate to the best fork point candidate combination, and sets 
the fork point candidate combination as a test combination; 

a step in which, if the selected fork point candidate is contained in the best fork point 
candidate combination, the best fork point candidate combination determination unit removes 
the selected fork point candidate from the best fork point candidate combination, and sets the 
fork point candidate combination as the test combination; 

a step in which the best fork point candidate combination determination unit evaluates 
the parallel execution performance of parallelization by the test combination through the 
parallel execution performance evaluation unit; 

a step in which the best fork point candidate combination determination unit compares 
the parallel execution performance of the test combination with the parallel execution 
performance of the best combination; 

a step in which, if the parallel execution performance of the test combination is better, 
the best fork point candidate combination determination unit sets the test combination as the 
best fork point candidate combination at the current time; 

a step in which the best fork point candidate combination determination unit assesses 
whether a fork point candidate that has not been selected exists, and if a fork point candidate 
that has not been selected exists, reiterates execution; 

a step in which, if a non-selected fork point candidate does not exist, the best fork 
point candidate combination determination unit assesses whether a new best fork point 
candidate combination is found in the previous iterative execution; 
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a step in which, if a new best fork point candidate combination is found, the best fork 
point candidate combination determination unit sets all the fork point candidates to the non- 
selected state for the iterative execution; 

a step in which, if a new best fork point candidate combination is not found, the best 
fork point candidate combination determination unit outputs the determined best fork point 
candidate combination as the result of the best fork point candidate combination 
determination processing; and 

a step in which a parallelized program output unit generates and outputs the 
parallelized program by inserting a fork command at each fork point candidate of the best 
combination determined by the best fork point candidate combination determination unit a 

wherein the sequential processing program is transformed into the parallelized 
program . 

1 1 . (Currently Amended) A program parallelization method for a multithreading 
method in which a sequential processing program is divided into a plurality of threads and a 
plurality of processors execute the threads in parallel, comprising: 

a step in which a control/data flow analysis unit analyzes the control flow and the data 
flow of a sequential processing program; 

a step in which a fork point candidate determination unit generates the fork point 
candidates by referring to the results of the analysis of the control flow and the data flow by 
the control/data flow analysis unit; 

a step in which a best fork point candidate combination determination unit predicts the 
effect of each of all the fork point candidates on the parallel execution performance and ranks 
the fork point candidates in the order of the effect; 

a step in which the best fork point candidate combination determination unit generates 
an initial fork point candidate combination whose parallel execution performance is evaluated 
first, and which is assumed to be the best fork point candidate combination; 

a step in which a parallel execution performance evaluation unit generates a sequential 
execution trace when the sequential processing program was sequentially executed with the 
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input data; a step in which the parallel execution performance evaluation unit divides the 
sequential execution trace by taking all the terminal point candidates as division points; 

a step in which the parallel execution performance evaluation unit analyzes thread 
element information for each thread element, and memorizes the thread element information 
for each thread element; 

a step in which a best fork point candidate combination determination unit selects one 
fork point candidate that is ranked highest order among the non-selected fork point 
candidates; 

a step in which the best fork point candidate combination determination unit assesses 
whether the selected fork point candidate is contained in the best fork point candidate 
combination; 

a step in which, if the selected fork point candidate is not contained in the best fork 
point candidate combination, the best fork point candidate combination determination unit 
adds the selected fork point candidate to the best fork point candidate combination, and sets 
the fork point candidate combination as a test combination; a step in which the best fork point 
candidate combination determination unit removes the fork point candidate that cancels the 
selected fork point candidate from the test combination; 

a step in which, if the selected fork point candidate is contained in the best fork point 
candidate combination, the best fork point candidate combination determination unit removes 
the selected fork point candidate from the best fork point candidate combination, and sets the 
fork point candidate combination as the test combination; 

a step in which the best fork point candidate combination determination unit evaluates 
the parallel execution performance of parallelization by the test combination through the 
parallel execution performance evaluation unit; 

a step in which the best fork point candidate combination determination unit compares 
the parallel execution performance of the test combination with the parallel execution 
performance of the best combination; 
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a step in which, if the parallel execution performance of the test combination is better, 
the best fork point candidate combination determination unit removes the fork point candidate 
that is canceled by the selected fork point candidate from the test combination; 

a step in which the best fork point candidate combination determination unit sets the 
test combination as the best fork point candidate combination at the current time; 

a step in which the best fork point candidate combination determination unit assesses 
whether a fork point candidate that has not been selected exists, and if a fork point candidate 
that has not been selected exists, reiterates execution; 

a step in which, if a non-selected fork point candidate does not exist, the best fork 
point candidate combination determination unit assesses whether a new best fork point 
candidate combination is found in the previous iterative execution; 

a step in which, if a new best fork point candidate combination is found, the best fork 
point candidate combination determination unit sets all the fork point candidates to the non- 
selected state for the iterative execution; 

a step in which, if a new best fork point candidate combination is not found, the best 
fork point candidate combination determination unit outputs the determined best fork point 
candidate combination to the parallelized program output unit as the result of the best fork 
point candidate combination determination processing; and 

a step in which the parallelized program output unit generates and outputs the 
parallelized program by inserting a fork command at each fork point candidate of the best 
combination determined by the best fork point candidate combination determination unit, 

wherein the sequential processing program is transformed into the parallelized 
program . 

12. (Original) The program parallelization method as set forth in claim 8, 

wherein the step of evaluating said parallel execution performance 

generates a sequential execution trace when the sequential processing program was 
sequentially executed with the input data, divides the sequential execution trace by taking all 
the terminal point candidates as division points, analyzes thread element information for each 
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thread element, and simulates parallel execution by units of thread element with respect to the 
test combination of the fork point candidates that were given to calculate the parallel 
execution performance. 

13. (Original) The program parallelization method as set forth in claim 8, 

wherein the step of determining the best combination of said fork point candidates 

constructs a better combination by ranking the fork point candidates determined by 
said fork candidate determination unit in the order in which the fork point candidates are 
predicted to have an influence on parallel execution performance, and evaluating the parallel 
execution performance according to the order by taking the best fork point candidate 
combination at that time as the reference. 

14. (Original) The program parallelization method as set forth in claim 8, 

wherein the step of determining the best combination of said fork point candidates 

divides the collection of all the fork point candidates into fork point candidate groups 
in such a way that the fork point candidates have as little effects as possible on each other, 
generates a test fork point candidate combination for a group in the divided fork point 
candidate groups in which the best fork point candidate combination determination 
processing has not been performed, performs the best fork point candidate combination 
determination processing that determines the best fork point candidate combination by 
referring to the result of parallel execution performance of the test fork point candidate 
combination evaluated with respect to an input data, and determines the sum of the best fork 
point candidate combinations, which are the processing results for each group, as the overall 
processing result. 

15. (Original) The program parallelization method as set forth in claim 8, 

wherein the step of determining the best combination of said fork point candidates 

calls the fork point candidate group partition processing taking the collection of all the 
fork point candidates determined by the fork point candidate determination unit as a 
collection after the processing of said fork point candidate determination unit has been 
completed, when the fork point candidate group partition processing is called, starts the group 
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partition processing of the collection if the number of fork point candidates belonging to the 
given fork point candidate collection is higher than the designated division number lower 
limit, returns to the origin from where the fork point candidate group partition processing was 
called without performing the group partition processing if the number of the fork point 
candidates is lower, divides from the collection the fork point candidate collections in which 
the number of fork point candidates that cancel themselves is higher than the designated 
number to generate a new group, further divides the collection into two groups, recursively 
calls the fork point candidate group partition processing taking one group as a collection and 
performs group partitioning of the group, recursively calls the fork point candidate group 
partitioning process taking the other group as a collection and performs group partitioning of 
the group, returns to the origin from where the fork point candidate group partitioning process 
was called, performs the best fork point candidate combination determination processing for 
the groups in which the best fork point candidate combination determination processing has 
not been performed among the groups of fork point candidates that were divided, determines 
whether the processing of all the groups has been completed, if there is a group that has not 
been processed, reiterates the best fork point candidate combination determination processing 
for the groups that have not been processed, and, when the processing of all the groups has 
been completed, outputs the sum of the fork point candidate combination, which is the result 
of processing for each group, as the overall result. 

16. (Currently Amended) A computer readable medium embodying a computer 
program parallelization program that is executed on a computer causing, 

a computer to operate as 

a control/data flow analysis function which analyzes the control flow and the data 
flow of a sequential processing program; 

a fork point candidate determination function which determines the fork point 
candidates of the sequential processing program by referring to the results of the analysis of 
the control flow and the data flow by said control/data flow analysis function; 
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a parallel execution performance evaluation function which evaluates, with respect to 
an input data, a parallel execution performance when the sequential processing program has 
been parallelized by a test combination of fork point candidates that were given; 

a best fork point candidate combination determination function which generates a test 
combination of the fork point candidates that were determined by said fork point candidate 
determination function, provides the test combination to said parallel execution performance 
evaluation function, and by taking the parallel execution performance of the test fork point 
candidate combination evaluated thereby as the reference determines the best fork point 
candidate combination; and 

a parallelized program output function which generates and outputs a parallelized 
program by inserting a fork command at each fork point candidate of the best combination 
determined by said best fork point candidate combination determination function. 

17. (Currently Amended) A computer readable medium embodying a computer 
program parallelization program that is executed on a computer causing, 

a computer to operate as 

a control/data flow analysis function which analyzes the control flow and the data 
flow of a sequential processing program; 

a fork point candidate determination function which determines the fork point 
candidates of the sequential processing program by referring to the results of the analysis of 
the control flow and the data flow by said control/data flow analysis function; 

a parallel execution performance evaluation function which evaluates, with respect to 
an input data, a parallel execution performance when the sequential processing program has 
been parallelized by a test combination of fork point candidates that were given; 

a best fork point candidate combination determination function which generates a test 
combination only consisting of the combination of fork point candidates that can be 
simultaneously executed in the one-time fork model from the fork point candidates 
determined by said fork point candidate determination function, provides the test combination 
to said parallel execution performance evaluation function, and by taking the parallel 
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execution performance of the test fork point candidate combination evaluated thereby as the 
reference, determines the best fork point candidate combination; 

a parallelized program output function which generates and outputs a parallelized 
program by inserting a fork command at each fork point candidate of the best combination 
determined by said best fork point candidate combination determination function. 

18. (Currently Amended) The program parall e lization program computer readable 
medium as set forth in claim 16, 

wherein said parallel execution performance evaluation function 

generates a sequential execution trace when the sequential processing program was 
sequentially executed with the input data, divides the sequential execution trace by taking all 
the terminal point candidates as division points, analyzes thread element information for each 
thread element, and simulates parallel execution by units of thread element with respect to the 
test combination of the fork point candidates that were given to calculate the parallel 
execution performance. 

19. (Currently Amended) The program parall e lization program computer readable 
medium as set forth in claim 16, 

wherein said best fork point candidate combination determination function 

constructs a better combination by ranking the fork point candidates determined by 
said fork candidate determination function in the order in which the fork point candidates are 
predicted to have an influence on parallel execution performance, and evaluating the parallel 
execution performance according to the order by taking the best fork point candidate 
combination at that time as the reference. 

20. (Currently Amended) The program parall e lization program computer readable 
medium as set forth in claim 1 9, 

wherein said best fork point candidate combination determination function 

assuming that the combination of the fork point candidates including the prescribed 
numbers from the top in the order of the fork point candidates determined is an initial 
combination, evaluates the parallel execution performance of the initial combination with said 
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parallel execution performance evaluation function, and sets the initial combination to the 
best fork point candidate combination at this time. 

2 1 . (Currently Amended) The program parallolization program computer readable 
medium as set forth in claim 16, 

wherein said best fork point candidate combination determination function 

divides the collection of all the fork point candidates that have been determined by 
said fork point candidate determination function into fork point candidate groups in such a 
way that the fork point candidates have as little effects as possible on each other, generates a 
test fork point candidate combination for a group in the divided fork point candidate groups 
in which the best fork point candidate combination determination processing has not been 
performed, performs the best fork point candidate combination determination processing that 
determines the best fork point candidate combination by referring to the result of parallel 
execution performance of the test fork point candidate combination evaluated by said parallel 
execution performance evaluation function, and determines the sum of the best fork point 
candidate combinations, which are the processing results for each group, as the overall 
processing result. 

22. (Currently Amended) The program parall e lization program computer readable 
medium as set forth in claim 2 1 , 

wherein said best fork point candidate combination determination function 

calls the fork point candidate group partition processing taking the collection of all the 
fork point candidates determined by said fork point candidate determination function as a 
collection after the processing of said fork point candidate determination function has been 
completed, when the fork point candidate group partition processing is called, starts the group 
partition processing of the collection if the number of fork point candidates belonging to the 
given fork point candidate collection is higher than the designated division number lower 
limit, returns to the origin from where the fork point candidate group partition processing was 
called without performing the group partition processing if the number of the fork point 
candidates is lower, divides from the collection the fork point candidate collections in which 
the number of fork point candidates that cancel themselves is higher than the designated 

-16- 

WASH 4488777.1 



Atty. Dkt. No. 040405-0368 



number to generate a new group, further divides the collection into two groups, recursively 
calls the fork point candidate group partition processing taking one group as a collection and 
performs group partitioning of the group, recursively calls the fork point candidate group 
partitioning process taking the other group as a collection and performs group partitioning of 
the group, returns to the origin from where the fork point candidate group partitioning process 
was called, performs the best fork point candidate combination determination processing for 
the groups in which the best fork point candidate combination determination processing has 
not been performed among the groups of fork point candidates that were divided, determines 
whether the processing of all the groups has been completed, if there is a group that has not 
been processed, reiterates the best fork point candidate combination determination processing 
for the groups that have not been processed, and, when the processing of all the groups has 
been completed, outputs the sum of the fork point candidate combination, which is the result 
of processing for each group, as the overall result. 
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